Decision Trellis Models for Tuple Categorization in Databases

نویسندگان

  • Paolo Frasconi
  • Marco Gori
  • Giovanni Soda
چکیده

We introduce a probabilistic graphical model for supervised learning on databases with categorical attributes. The proposed graph contains hidden variables that play a role similar to nodes in decision trees and each of their states either corresponds to a class label or to a single attribute test. As a major diierence with respect to decision trees, the selection of the attribute to be tested is probabilistic. Thus, the architecture can be used to assess the probability that a tuple belongs to some class, given the predictive attributes. The training algorithm can be easily derived in the general framework of graphical models, using expectation-maximization (EM) for nding the optimal parameters. We propose decision trellises as an alternative to decision trees in the context of tuple categorization in databases, which is an important step for building data mining systems. Preliminary experiments on some standard databases are reported, comparing the classiication accuracy of decision trellises and decision trees induced by C4.5.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

2-tuple intuitionistic fuzzy linguistic aggregation operators in multiple attribute decision making

In this paper, we investigate the multiple attribute decisionmaking (MADM) problems with 2-tuple intuitionistic fuzzylinguistic information. Then, we utilize arithmetic and geometricoperations to develop some 2-tuple intuitionistic fuzzy linguisticaggregation operators. The prominent characteristic of theseproposed operators are studied. Then, we have utilized theseoperators to develop some app...

متن کامل

A Hybrid Multi-attribute Group Decision Making Method Based on Grey Linguistic 2-tuple

Because of the complexity of decision-making environment, the uncertainty of fuzziness and the uncertainty of grey maybe coexist in the problems of multi-attribute group decision making. In this paper, we study the problems of multi-attribute group decision making with hybrid grey attribute data (the precise values, interval numbers and linguistic fuzzy variables coexist, and each attribute val...

متن کامل

Deriving Spatiotemporal Relations from Simple Data Structure

A spatiotemporal data model is incomplete without three components: classes, consistency constraints, and operators. Classes define the structure of the model, constraints enforce consistency in the model, and operators operate on the structure of the model. In the past, many models have been proposed, but most of them discussed the classes. The studies on operators for spatiotemporal data mode...

متن کامل

An attribute or tuple timestamping in bitemporal relational databases

Much of the research on bitemporal databases has focused on the modeling of time-related data with either attribute or tuple timestamping. While the attribute-timestamping approach attaches bitemporal data to attributes, the tuple-timestamping approach splits the object’s history into several tuples. Although there have been numerous studies on bitemporal data models, there is no work contrasti...

متن کامل

$k$-tuple total restrained domination/domatic in graphs

‎For any integer $kgeq 1$‎, ‎a set $S$ of vertices in a graph $G=(V,E)$ is a $k$-‎tuple total dominating set of $G$ if any vertex‎ ‎of $G$ is adjacent to at least $k$ vertices in $S$‎, ‎and any vertex‎ ‎of $V-S$ is adjacent to at least $k$ vertices in $V-S$‎. ‎The minimum number of vertices of such a set‎ ‎in $G$ we call the $k$-tuple total restrained domination number of $G$‎. ‎The maximum num...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996